Acceleration of Multiple Precision Matrix Multiplication Based on Multi-component Floating-Point Arithmetic Using AVX2
نویسندگان
چکیده
In this paper, we report the results obtained from acceleration of multi-binary64-type multiple precision matrix multiplication with AVX2. We target double-double (DD), triple-double (TD), and quad-double (QD) arithmetic designed by certain types error-free transformation (EFT) arithmetic. Furthermore, implement SIMDized EFT functions, which simultaneously compute four binary64 numbers on x86_64 computing environment, using help them, also develop DD, TD, QD additions multiplications. addition, AVX2 load/store functions were adopted to efficiently speed up reading storing elements from/to memory. Owing these combined techniques, our implemented multiplications have been accelerated more than three times compared non-accelerated ones. Our modifies performance parallelization OpenMP.
منابع مشابه
Double precision floating-point arithmetic on FPGAs
We present low cost FPGA floating-point arithmetic circuits for all the common operations, i.e. addition/subtraction, multiplication, division and square root. Such circuits can be extremely useful in the FPGA implementation of complex systems that benefit from the reprogrammability and parallelism of the FPGA device but also require a general purpose arithmetic unit. While previous work has co...
متن کاملPrecision Arithmetic: A New Floating-Point Arithmetic
A new deterministic floating-point arithmetic called precision arithmetic is developed to track precision for arithmetic calculations. It uses a novel rounding scheme to avoid the excessive rounding error propagation of conventional floating-point arithmetic. Unlike interval arithmetic, its uncertainty tracking is based on statistics and the central limit theorem, with a much tighter bounding r...
متن کاملA Fortran Package For Floating-Point Multiple-Precision Arithmetic
FM is a package of Fortran subroutines which performs floating-point multiple-precision arithmetic. Many such packages have been written, and FM provides several improvements to previous packages in the areas of speed, accuracy, exception handling, and maintainability of the code. Brent’s MP package [6] is probably the most widely used of these packages at present, due to its greater functional...
متن کاملAlgorithms for arbitrary precision floating point arithmetic
We present techniques which may be used to perform computations of very high accuracy using only straightforward oating point arithmetic operations of limited precision, and we prove the validity of these techniques under very general hypotheses satissed by most implementations of oating point arithmetic. To illustrate the application of these techniques, we present an algorithm which computes ...
متن کاملFPGA Based Quadruple Precision Floating Point Arithmetic for Scientific Computations
In this project we explore the capability and flexibility of FPGA solutions in a sense to accelerate scientific computing applications which require very high precision arithmetic, based on IEEE 754 standard 128-bit floating-point number representations. Field Programmable Gate Arrays (FPGA) is increasingly being used to design high end computationally intense microprocessors capable of handlin...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Lecture Notes in Computer Science
سال: 2021
ISSN: ['1611-3349', '0302-9743']
DOI: https://doi.org/10.1007/978-3-030-86976-2_14